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1. Introduction 
The field of science has witnessed many great inventions inspired by bionics, i.e the 
application of biological principles to the study and design of human systems (Yu et al., 
2013). The submarine, for example, is an invention that mimics fish. Similarly, evolutionary 
algorithms (EAs) are algorithms that utilize evolutionary principles (survival of the fittest) to 
build adaptive systems in order to solve complex optimization problems that normally cannot 


be solved by deterministic algorithms (Yu et al., 2013). 


Genetic algorithm, a metaheuristic pioneered by John Holland in the 1970s, is perhaps the 
most well-known among different types of evolutionary algorithms such as evolutionary 
programming, evolution strategies and genetic programming (Dasgupta & Michalewicz, 
1997). It has been applied in various fields such as pattern recognition, robotics, artificial life, 
experts system, electronic and electrical field, cellular automata, etc (Dasgupta & 
Michalewicz, 1997). As a part of the larger class of evolutionary algorithms, the genetic 
algorithm also mimics the process of natural selection to solve optimization and search 
problems based on biological operators such as crossover, mutation and selection. A typical 
genetic algorithm consists of the following steps: initialization, evaluation, selection, 
crossover and mutation. Depending on the problems, there are several approaches that can be 


used for each step of the algorithm. 


The Knapsack problem was pioneered by Dantzig in the late 1950s, opening a great number 
of extensive and intensive research later on in this field (Badiru, 1970). The problem 
exemplifies a real-life situation where we have to assign a set of items into a knapsack or a 
number of knapsacks in which each item has different sizes and values while the knapsack 


has a limited capacity. Our goal is to maximize the total value of the items without exceeding 


the capacity(s) of the knapsack(s). The knapsack problem is classified as an NP-hard problem 
whose solutions cannot be obtained by the application of polynomial-time algorithms 
(Badiru, 1970). However, thanks to years of research done by scientists have presented 
several approaches that can be used to easily solve this problem such as dynamic 


programming, recursive approach, greedy algorithm, and genetic algorithm. 


This paper aims to investigate the application of the genetic algorithm to the knapsack 
problem, specifically evaluating the performance of the two different selection strategies 
used: roulette wheel selection and tournament selection with different parameters. The paper 
will also carry out experiments with and without elitism - an algorithm that preserves the best 
individuals to the next generations to observe whether this factor would affect the 


performance of the algorithm in proposing the optimal solution or not. 


This research could be proved helpful in presenting a more optimal approach when utilizing 
the genetic algorithm to solve the knapsack problem. The problem has a plethora of real-life 
applications that reguire computer processing allocations in distributed systems such as 
financial modelling, production and inventory management systems, design of gueuing 
network models in manufacturing and last but not least, control of traffic overload in 
telecommunication systems (Badiru, 1970). With a faster performance in solving the problem 
of traffic overload, for example, it will help to prioritize different data that need to be 
transferred in the network with a scarcely available bandwidth; hence, improving productivity 


and saving lots of money in various fields. 


To evaluate the performance of the two selection strategies used, an experiment would be 


carried out to calculate how many generations it takes for each strategy to find out the 


optimal solution and how close it is compared to the best solution, given that the termination 
condition is the same. The essay would also consider the impact of the crossover rate, 
mutation rate and elitism to see to what extent these variables influence the performance of 


each strategy. 


2. Theory 
2.1. Genetic algorithm 
Genetic algorithm is a heuristic algorithm that is used to solve optimization problems in 
computational mathematics (Pan & Zhang, 2018). The algorithm applies “Darwinian 
principles of survival” to its operation, also known as “the survival of the fittest". The fitter 
individuals will have a higher chance to adapt to the environment and therefore survive and 
reproduce new generations that are more endurable to nature. Similarly, using this principle, 
the genetic algorithm consists of three genetic operations: selection, crossover and mutation 


(Zhong et al., 2006). The process is illustrated by the flowchart below: 


START 


Getting input of each item's weight and value, knapsack capacity, 
crossover rate, mutation rate and the selection strategy 


Generate initial random population of the 
given population size 


Evaluate the fitness and validate the weight of 
each chromosome in the generation 


Is the termination 
condition satisfied? 


No 


If elitism is chosen, choose 2 chromosomes from 
the population with the highest fitness value and 
copy them to the next generation 


Select 2 chromosomes from the 
population to breed 


Perform crossover on the 2 selected 
chromosomes 


Perform mutation on a certain amount of 
chromosomes based on the mutation rate 


Is the termination 
condition satisfied? 


Yes STOP 


Figure 1: Genetic algorithm flowchart 


2.1.1. Exploitation and Exploration 
Exploitation (also called intensification) characterizes the extent to which the algorithm 
preserves the properties of the fittest solutions in the population. An algorithm with a high 
rate of exploitation will move towards the most promising areas of the search space around 
the best solutions found so far (Hao & Solnon, 2019). In figure 2, the population fitness will 
gradually gather in one of the peaks. However, as illustrated in the same figure, there are 
various peaks that can exist in one search space. The tallest peak (both positively and 
negatively) is called the global optima while the lower ones are local optima. Due to this 
factor, high exploitation might lead the population to be stuck in one of the local optima; 


thus, the result found will not be the best solution possible. 
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Figure 2: Illustration of a search space (source: Zachary Kaplan) 


Contrastingly, exploration (also called diversification) highlights the diversity of the 
population. It aims at expanding the search space to discover new areas that may have better 


solutions. By doing this, the algorithm will avoid being stuck in the local optima and hence, 


have a higher chance of reaching the global optima. However, a high rate of exploration 
would lead to a scattered population that cannot converge, which is also not a desired 


outcome that we want to achieve. 


In short, exploitation and exploration play a crucial role in every search algorithm. A 
successful search algorithm reguires a good ratio between exploration and exploitation 
(Crepin&ek et al., 2013). This is achieved by modifying the parameters of the algorithm such 
as crossover rate, mutation rate, population size, etc, which will be investigated in the 


experimenting process. 


2.1.2. Crossover 
Crossover is the process of mixing bits of two chromosomes to create an offspring that has 
the genotype of both parents for the next generation. Firstly, it randomly chooses a locus on 
the chromosome, then it exchanges the subseguences before and after that locus to create the 
offspring (Hristakeva & Shrestha, 2022). Crossover increases the diversity of the population, 


therefore increasing the exploration rate of the algorithm. 


U 
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Figure 3: Illustration for crossover between two chromosomes 
In figure 3, for example, the locus chosen to be crossed over is gene number. After crossover, 


the offspring now has the genes of both parent 1 and parent 2. 


2.1.3. Mutation 
Mutation is a genetic operator that helps to expand the search space, thereby preventing the 
GAs from being stuck in local optima. The mutation of GA in the knapsack problem is a bit 
string mutation in which it flips the bit at random positions of the chromosomes. For 
example, a chromosome with a composition of 1 0 1 0 after mutation at the second bit will be 


1110. 


2.1.4. Premature convergence 
Premature convergence causes loss of diversity, which is a problem that many Evolution 
Computation systems face (Crepin&ek et al., 2013). This phenomenon happens when a few fit 
individuals in the initial population dominate the whole population, preventing the population 
from exploring potentially better individuals (Andre et al., 2000). Since it is very difficult for 
the population to move towards a better solution once converged, the population will be stuck 
in the local optima. Therefore, it is prerequisite that the population needs to be diverse for 


more exploration to avoid this phenomenon. 


2.1.5. Elitism 
Elitism is an algorithm that preserves the first best individuals or the few best individuals (the 
elites) in the generation to the new population (Sharma et al., 2014). This method ensures that 
good solutions are not lost during the breeding process so that the fitness value of the 
upcoming generations will increase. In some cases, elitism can improve the performance of 
the program significantly as it generates a very fit population (Sharma et al., 2014). However, 


one thing to consider when applying elitism in GA is that elitism makes the algorithm 


become much more exploitative. It thus causes premature convergence to happen (Kutubi et 


al., 2018). 


2.1.6. Termination condition 
The algorithm is terminated when either one of these conditions is met: two consecutive 


generations have the same mean fitness value or the limit number of generations is reached. 


22. Selection strategies 
There are several selection strategies such as truncation selection, rank-based selection, 
deterministic sampling, roulette wheel and tournament selection. Fach selection has different 
characteristics. This paper is investigating the roulette wheel and tournament selection since 


they are one of the most well-known selection strategies being used for the genetic algorithm. 


2.2.1. Roulette Wheel Selection 
Roulette wheel selection is the most freguently used selection strategy (Zhong et al., 2006). 
As suggested by the name, this strategy is influenced by the proportional selecting principle 
of the physical roulette wheel. The wheel is divided into sections that correspond to the 
amount of the value of winning, i.e the larger the winning is, the smaller the sector on the 
spin will be. Therefore, when the wheel is spun, the winning probability will be lower. 
Similarly, the roulette wheel selection strategy used in the GA also applies this principle but 
conversely, the higher the fitness, the larger the sector divided on the wheel will be and vice 


versa. 


Roulette Wheel Selection 
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Figure 4: Illustration of the roulette wheel 


The probability of an individual i to be selected in the roulette wheel selection can be 


calculated by the formula below, in which f, is the fitness of i and / is the number of 


individuals in the population: 


For example, given a population of 4 individuals with the consecutive fitness scores of 10, 


20, 30, 40. The probability that the individual number 4 is selected is 


40 


D, 10+20+30+40 — 0.4 


Due to its characteristic, roulette wheel selection always gives a chance for all of the 
individuals in the population, even the weaker ones to be selected. Thus, this trait helps to 


expand the search space, making this selection more explorative. 
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2.2.2. Tournament Selection 
Tournament selection chooses the individuals merely based on their fitness value. As the 
name suggests, the algorithm will first randomly choose a certain number of individuals from 
the population, then it will compare the fitness value between them and finally, choose the 
one with the highest fitness values to breed and reproduce the next generation. Unlike 
roulette wheel selection, there is no arithmetical computation based on the fitness value in 
tournament selection (Zhong et al., 2006). The number of individuals chosen for the 


tournament is called tournament size. 


Although all of the individuals in the population have the same chance to be selected. Since 
the tournament is merely based on comparing the fitness of the individuals, the individuals 
with higher fitness value will have a much higher chance to be selected for the next 


generation, which makes the algorithm become more exploitative. 


2.3. The Knapsack Problem 
The Knapsack Problem is a typical combinatorial optimization problem with more than 40 
years of research (Pan & Zhang, 2018). This problem can be described mathematically as 
follows: given that you have a knapsack of capacity W, which is the maximum weight that 


your knapsack can hold. You have a list of n items, each with a weight of w, and a value of UR 


Our goal is to maximize the value of items that we can bring without exceeding the knapsack 


capacity (Jaszkiewicz, 2002), 


n 
maximise J, Vax, 
i=1 


n 


subject to ); w.x, S W and x € (0; 1) 
i=1 


11 


x, is the number of instances of item i to include in the knapsack. The range of x can only be 


either 0 or 1 because the item can only be left behind or taken. Hence, this is why the 


knapsack problem is also Known as the 0/1 knapsack problem. 


Figure 5: An illustration of the knapsack problem (source: Wikipedia) 


The knapsack problem is an NP-C (Non-deterministic Polynomial Completeness) problem. 
Since there are n items, each with two binary options 0/1, there are 2” possible combinations 


of items, making its computation complexity O(2") (Kellerer et al., 2004). This problem can 
be solved by the classical Brute Force algorithm by trying out all possible solutions; however, 
due to the exponential complexity, this algorithm is only applicable with a small value of n 
(Pan & Zhang, 2018). That is the reason why other non-deterministic algorithms such as 


dynamic programming and genetic algorithm are more effective in solving this problem. 


Figure 6: Binary code for the Knapsack problem with 5 items 


Figure 5 exemplifies a solution for the Knapsack problem with 5 items consecutively marked 
from 1 to 5. The binary option 1 means that the item will be taken and 0 means the item will 


be left out. So in the example of figure 5, 3 items will be taken: item 1, 4 and 5. 


3. Hypothesis 
Prior to my research, there has been numerous research about the performance of the genetic 
algorithm. A research paper done by Jinghui Zhong and the others has found out that 
tournament selection is more effective in convergence than roulette wheel in solving different 


functions (Zhong et al., 2006). This case might also be true to my investigation. 


Given that tournament selection has the time complexity of O (n), while for roulette wheel is 


O(n’) (Sharma et al., 2014), it is certain that the tournament selection is more efficient than 
the roulette wheel selection in terms of time complexity. Combining this feature with the 
higher rate of exploitation, I hypothesize that tournament selection will outperform the 
roulette wheel in terms of convergence rate. However, due to its higher exploitation rate, it is 
likely that the tournament selection might face premature convergence, especially with 
elitism. In that case, the fast pace of converging might compensate for the quality of the 
output solution of the tournament selection. A higher mutation rate and crossover rate might 


increase the diversity of the population, thus improving the quality of the solution. 


Meanwhile, unlike tournament selection, the roulette wheel has a better balance of 
exploration and exploitation. Since it is more explorative, a diverse search space would lead 
to a higher chance of finding the best solution compared to that of tournament selection so the 


output quality of the roulette wheel selection would be better than that of the tournament. 


Nevertheless, due to the process of calculating the fitness proportion, this selection will take a 


longer time to converge and more computational energy than tournament selection. 


4. Methodology 
In this paper, besides literature research, I also used an empirical approach to compare the 
two strategies. This section features the detailed experimental procedures and the variables 


used to determine the results, with reference to the Java code. In order 


4.1. The experimental procedures 
The GA is run several times with three sets of data and several parameter combinations, i.e 
different crossover rates, mutation rates and with/without elitism. For each test, the statistics 
of the GA performance, as well as the graph of the mean fitness by generations are recorded 


for later analysis. 


The detailed procedure is as follow: 

e Find and set up the suitable dataset for the Knapsack Problem. 

e_ Set up the program to insert the data fetched from the given dataset including the 
values and weight of each item in the list, the crossover rate, the mutation rate and the 
strategy used. Because the original code I used only had a general selection strategy 
so I went on to write an implementing code for tournament selection and roulette 
wheel selection. 

e Run the program several times using different inputs of the population size. 

e Record the total generations the process takes, the fitness score and the generation in 
which the individual with the best fitness occurs. 


e_ Synthesize the taken data into tables and graphs. 


4.3. Independent variables 
a. Dataset used 
The experiment was conducted using a dataset directory created by Donald Kreher, Douglas 
Simpson and Silvano Martello, Paolo Toth. The given knapsack has a weight capacity of 750. 
The item list contains 15 objects with different weights and values. As long as the total 
weight of items does not exceed the knapsack's capacity, the subset of the objects is 
considered qualified. The dataset is also given with the most optimal profit and the most 
optimal selection so that we can compare it with the solutions given by the algorithm to 


analyze the efficiency of the algorithm (which is further explained in 4.5) 


Capacity: 750 Optimal profit: 1458 


Figure 7: Dataset for the Knapsack Problem 


b. Other parameters 


Population size 100, 200, 300, 400, 500, 750, 1000 


Tournament size 


Elitism 2 chromosomes with the highest fitness value in the 
population is chosen to be in the next generation. 


Maximum generation | 5000 


43. Dependent variables 
The dependent variable measured in this experiment is the number of generations the 
algorithm takes to find the most optimal solution, the generation in which the best solution 
occurs and the fitness of the best individual in the population at the end of the process when 
the most optimized solution is found or the algorithm reaches its limit population. These 
variables will then be taken to evaluate the optimization speed (measured by the number of 
generations including the initial generation) and the optimization reliability (measured by the 


fitness of the best individual). 


4.4. Controlled variables 


Computer and |MacBook Pro with | Version: 11.5.2 


operating macOS Big Sur Processor: 1,4 GHz Quad-Core Intel Core 15 


system used Memory: 8GB 2133 MHz LPDDR3 


Serial Number: FVFD240PP3Y 1 


Integrated The IDE that the 
Development |program is running 
Environment 


(IDE) used 


Algorithm used | The algorithm used 
in the experiment is 


in Appendix A 


Functions used | Most of the functions 
in the program will 
be the same except 
from the selecting 


function 


4.5. Efficiency measure 


IDE: IntelliJ IDEA CE 2021.3 
Build #IC-213.5744.223 
Runtime version: 11.0.13+7-b1751.19 x86 64 


Java Runtime Environment: 


Java Virtual Machine: OpenJDK 64-Bit 


Server VM by JetBrains s.r.o. 

macOS 11.5.2 

GC: Gl Young Generation, Gl Old 
Generation 

Memory: 1024M 


Cores: 8 


The efficiency of the performance of two selection strategies is measured based on two 


factors: convergence rate (speed) and solution reliability (quality). The variables used to 


measure both of these factors are given in 4.2 Dependent variables. The convergence speed is 
measured by the number of generations it takes to find the most optimized solution. The 
smaller the number of generations is, the more guickly the algorithm converges. Meanwhile, 
its reliability is measured by the difference between the fitness of the solution found with the 
most optimized solution that is already known beforehand in the dataset (given in 4.2.a). 
S. Experiment Results and Analysis 
5.1. Experiment 1 
e Crossover rate: 0.85 
e Mutation rate: 0.01 


e Without elitism 


Population size 


100 
200 


500 945 572 


Since this is the first experiment, not much conclusion can be withdrawn from it. However, 
we can see that tournament selection converges faster since it takes fewer generations to 
converge (as shown in the “No. of generations" column) compared to roulette wheel, except 
for the last run with the population size of 1000. This might be due to the big size of the 


population but in most other cases, it is certain that the tournament selection is faster in terms 


of converging rate. Regarding the fitness score, since GA is non-deterministic, the fitness 
score varies differently for each run and it is not certain to conclude the difference between 


the two strategies. 


5.2. Experiment 2 
e Crossover rate: 0.85 
e Mutation rate: 0.01 


e With elitism 


Population size No. of Best Best fitness No. of Best 
generations | generation score generations | generation 


Compared to the first experiment, there is a clear difference in both strategies when elitism is 
added. Starting with roulette wheel, the converging rate has significantly improved as it takes 
much fewer generations to converge. It seems like without elitism, roulette wheel usually 
misses the potential best solution as it is not guaranteed to be selected for the next 
generations. Therefore, it takes a longer time to converge although the best solution is usually 
found in the early generations. For tournament selection, the best solution is mostly found 
within the first generation (generation 0) while the fitness score of the solution found is not 


the best one (we already know the best solution has the fitness value of 1458) regardless of 


the population size. Therefore, it is likely that premature has happened to tournament 


selection as elitism makes the algorithm become much more exploitative. 


5.3. Experiment 3 
e Crossover rate: 0.95 
e Mutation rate: 0.01 


e Without elitism 


No. of 


For experiments 3 and 4, the crossover rate is increased to see its impact. For roulette wheel 
selection, the probability that the algorithm found the most optimized solution increased 
significantly, especially with the big population size. An expansion of the search space might 
explain this result due to the increased crossover rate, however, the population seems to never 
converge but rather scatter around as the algorithm only stopped when it reaches the 
maximum generations. For tournament, similar to the first two experiments, it still takes less 


time to converge than roulette wheel. 
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5.4. Experiment 4 
e Crossover rate: 0.95 
e Mutation rate: 0.01 


e With elitism 


Population size No. of Best Best fitness No. of Best Best fitness 
generations | generation score generations | generation score 


7 
13 
36 
16 
56 


12 
15 

3 

2 
32 
59 
36 


| we x 3 


When adding in elitism, once again premature occurs in the tournament selection. Hence, the 
increased crossover rate does not seem to have any influence on tournament in this case. 
However, this phenomenon surprisingly has also occurred with roulette wheel. Compared to 
experiment 2 where elitism was also applied, in this experiment, roulette wheel also takes 
significantly less generation to converge, the only difference is that the probability that it 


reaches the global maximum is much less than that of experiment 2. 
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5.5. Experiment 5 
e Crossover rate: 0.85 
e Mutation rate: 0.03 


e Without elitism 


Population size No. of Best Best fitness No. of Best Best fitness 
generations | generation score generations | generation score 

100 286 100 1449 1446 

200 357 171 1451 445 1448 


In experiment 5 and 6, the mutation rate is increased. This experiment interestingly witnessed 
a significant increase in the quality of the solutions found in both selection strategies. 
Especially in tournament selection, from the population size of 400 to 1000, the best fitness 
score is always found. Despite the similar quality of the solution found, tournament selection 
still surpasses the roulette wheel in terms of converging rate with a lower number of 


generations (less run time). 


22 


5.6. Experiment 6 
e Crossover rate: 0.85 
e Mutation rate: 0.03 


e With elitism 


generations | generation score generations | generation score 


400 156| 0| 1450| 219] 0 1446 


This experiment has the highest exploitation rate as mutation rate is increased and elitism is 
applied. Both selection strategies found their best solution in only the first three generations 
and none of them found the best optimal score. Therefore, it can be concluded that premature 


has appeared in both selection strategies, making both of them stuck at the local maxima. 


6. Conclusion 
The experiment has shown that the tournament selection converges much faster as it takes 
fewer generations to output the solution. It is also more exploitative than roulette wheel 
selection since the best solutions are usually found in early generations. Moreover, my 
hypothesis is proven to be correct: due to its high rate of exploitation, premature convergence 


has occurred when elitism is applied in tournament selection. 
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On the other hand, experiment 5 shows that an increase in the exploration rate can 
significantly improve the output of tournament selection. It means that with a balance of 
exploration and exploitation, tournament selection has a better performance with a fast 
converging rate and higher guality of solutions found. Meanwhile, the roulette wheel except 
for experiments 2 and 6 where premature convergence occurred, in most other cases, roulette 
wheel gives better solutions than tournament. The only problem with the roulette wheel is 


that it takes a longer time to converge without elitism. 


In short, the convergence rate of tournament selection is better than roulette wheel in most 
cases regardless of the configurations. The guality of the solution is not always assured and is 
varied with different configurations. However, an increase in mutation rate could fix this 
problem and improve the performance of the tournament selection. With that being said, 
tournament selection can be much more effective compared to roulette wheel, specifically 


with bigger optimization problems. 
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Appendices 


Appendix A - Genetic Algorithm for Knapsack Problem 


A.1: KnapsackProblem.java (Matthew Mayo, with modification from the student) 
y x* 

* Gfilename: KnapsackProblem. java 

* Gauthor: Matthew Mayo 

* Gmodified: 2014-04-08 

* Gdescription: Creates a KnapsackProblem object based on user input, 
* attempts to solve using a genetic algorithm; outputs 

* algorithm data step-by-step, generates list of optimal 

* items for problem, graphs mean fitness by generation; 

* optional command line argument output filename will 

* redirect all algorithm details output to output filename 

* in current directory, will overwrite output filename 

* contents if file exists 


* Gusage: java KnapsackProblem <output filename> 


my 
import java.io.Console; 
import java.io.File; 


import java.io.FileOutputStream; 


import java.io.FileNotFoundException; 
import java.io.PrintStream; 


import java.lang.StringBuilder; 


import java.util.*; 


public class KnapsackProblem { 


private boolean verbose = false; 


private boolean mutation - false; 


private int selectionOper = 0; 


private int crossover count = 0; 
private int clone count = 0; 
private int number of items = 0; 


private int elitismOper = 0; 


private int population size = 0; 
private int maximum generations = 0; 
//private int generation counter = 1; 
private int tournament size = 0; 
private double knapsack capacity = 0; 


private double prob crossover = 0; 


private double prob mutation = 0; 


private double total fitness of generation = 0; 

//private ArrayList<Double> tournament = new ArrayList<Double>(); 
private ArrayList<Double> value of items = new ArrayList<Double>(); 
private ArrayList<Double> weight of items - new ArrayList<Double>(); 


private ArrayList<Double> fitness = new ArrayList<Double>(); 


private ArrayList<Double> total weight of solution- new 
ArrayList<Double>(); 

private ArrayList<Double> best fitness of generation = new 
ArrayList<Double>(); 

private ArrayList<Double> second best fitness of generation = new 
ArrayList<Double>(); 

private ArrayList<Double> mean fitness of generation = new 
ArrayList<Double>(); 

private ArrayList<String> population - new ArrayList<String>(); 


private ArrayList<String> breed population = new ArrayList<String>(); 


private ArrayList<String> best solution of generation = new 


ArrayList<String>(); 


private ArrayList<String> second best solution of generation = new 


ArrayList<String>(); 


/** 
* Main method 
A 


public static void main(String[] args) ( 


// Check for command line argument output filename 


// If filename present, redirect all System.out to file 


try 
File file name - new File(args[0]); 
if (file name.exists()) ( 


file name.delete(); 


) 


FileOutputStream fos - new FileOutputStream(file name, true); 
PrintStream ps - new PrintStream(fos); 


System.setOut (ps); 


} catch (FileNotFoundException e) { 


System.out.println ("Problem with output file"); 


// Construct KnapsackProblem instance and pass control 


KnapsackProblem knap = new KnapsackProblem(); 


// Construct graph of mean fitness by generation 
SimpleGraph graph = new 


SimpleGraph (knap.mean fitness of generation, 
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"Mean Fitness by Generation"); 


/** 
* Default constructor 
*/ 


public KnapsackProblem() ( 


// Get user input 


this.getInput(); 


// Make first generation 


this.buildKnapsackProblem(); 


// Output summary 


this.showOptimalList(); 


/** 
* Controls knapsack problem logic and creates first generation 
Ay: 


public void buildKnapsackProblem() { 


// Generate initial random population (first generation) 


this.makePopulation(); 


// Start printing out summary 


System.out.println("\nInitial Generation:"); 


System.out.println(" "y; 
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System.out.println("Population:"); 


for (int i = 0; i < this.population size; i++) ( 


System.out.println((i + 1) + "- " + this.population.get (i)); 


// Evaluate fitness of initial population members 


this.evalPopulation(); 


// Output fitness summary 


System.out.println ("\nFitness:"); 
for (int i = 0; i < this.population size; i++) ( 
System.out.println((i + 1) + "- " + this.fitness.get(i)); 


// Find best solution of generation 


this.best solution of generation.add(this.population.get (this.getBestS 


olution())); 


// Find second best solution of generation 


this.second best solution of generation.add(this.population.get(this.g 


etSecondBestSolution())); 


// Output best solution of generation 


System.out.println("\nBest solution of initial generation: " + 


this.best solution of generation.get(0)); 


// Find mean solution of generation 


this.mean fitness of generation.add(this.getMeanFitness()); 
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// Output mean solution of generation 


System.out.println ("Mean fitness of initial generation: " + 


this.mean fitness of generation.get (0)); 


// Compute fitness of best solution of generation 


this.best fitness of generation.add(this.evalGene (this.population.get (this.ge 


tBestSolution()))); 


// Compute fitness of second best solution of generation 


this.second best fitness of generation.add(this.evalGene(this.population.get( 


this.getSecondBestSolution()))); 


// Output best fitness of generation 


System.out.println("Fitness score of best solution of initial 


generation: " + this.best fitness of generation.get (0)); 


// If maximum generations » 1, breed further generations 


if (this.maximum generations » 1) ( 


makeFurtherGenerations(); 


/** 
* Makes further generations beyond first, if necessary 
xy 


private void makeFurtherGenerations() ( 


// Breeding loops maximum generation number of times at most 
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for (int i = 1; i < this.maximum generations; i++) ( 


// Check for stopping criterion 


if ((this.maximum generations > 3) && (i > 4)) ( 


// Previous 2 generational fitness values 
double a = this.mean fitness of generation.get(i - 1); 


double b 


this.mean fitness of generation.get(i - 2); 


double c 


this.mean fitness of generation.get(i - 3); 


// If both 3 are equal, stop 

if (a == & b == c) ( 
System.out.println("NnStop criterion met"); 
maximum generations = i; 


break; 


/**The threshold is met 
if (this.total weight of solution.get(i) == 
knapsack capacity && this.fitness.get(i) > 


this.best fitness of generation.get(i)) 


f 
System.out.println("|nStop criterion met"); 
maximum generations = i; 
break; 

) 

#7 


// Reset some counters 


this.crossover count = 0; 
this.clone count = 0; 


this.mutation = false; 


// Breed population 
if(elitismOper == 0) { 
for (int j = 0; j < this.population size / 2; j++) { 
this.breedPopulation (i); 
} 
} else if(elitismOper == 1) ( 


for(int j = 0; j < ((this.population size / 2) - 1); j++) 


this.breedPopulation (i); 


// Clear fitness values of previous generation 


this. fitness.clear(); 


// Evaluate fitness of breed population members 


this.evalBreedPopulation (); 


// Copy breed population to population 
for (int k = 0; k < this.population size; k++) { 


this.population.set(k, this.breed population.get(k)); 


// Output population 
System.out.println("NnGeneration " + (i + 1) + ":"); 


if ((1 + 1) < 10) { 


System.out.println(" "ys 
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if ((i + 1) >= 10) { 


System.out.println(" "y; 


) 


if ((i + 1) >= 100) ( 


System.out.println(" "); 


) 
System.out.println("Population:"); 


for (int 1 = 0; 1< this.population size; l++) { 


System.out.println((l + 1) +" - " + this.population.get(1)); 


// Output fitness summary 
System.out.println("\nFitness:"); 


for (int m = 0; m < this.population size; m++) ( 


System.out.println((m + 1) +" - " + this.fitness.get (m)); 


// Clear breed population 


this.breed population.clear(); 


// Find best solution of generation 


this.best solution of generation.add(this.population.get (this.getBestSolution 


0); 


// Find second best solution of generation 


this.second best solution of generation.add(this.population.get(this.getSecon 


dBestSolution())); 


// Output best solution of generation 
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System.out.println("NnBest solution of generation "+ (i + 1) +": 


" + this.best solution of generation.get(i)); 


// Find mean solution of generation 


this.mean fitness of generation.add(this.getMeanFitness ()); 


// Output mean solution of generation 
System.out.println("Mean fitness of generation: " + 


this.mean fitness of generation.get(i)); 


// Compute fitness of best solution of generation 


this.best fitness of generation.add(this.evalGene (this.population.get (this.ge 


tBestSolution()))); 


// Compute fitness of second best solution of generation 


this.second best fitness of generation.add(this.evalGene (this.population.get( 


this.getSecondBestSolution()))); 


// Output best fitness of generation 
System.out.println("Fitness score of best solution of generation " 


+ (1 + 1) + ": " + this.best fitness of generation.get(i)); 


// Output crossover/cloning summary 

System.out.println("Crossover occurred " + this.crossover count + 
" times"); 

System.out.println("Cloning occurred " + this.clone count + " 
times"); 

if (this.mutation == false) ( 


System.out.println("Mutation did not occur"); 
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) 
if (this.mutation == true) { 


System.out.println("Mutation did occur"); 


private void stopCriterion() { 
SimpleGraph graph = new SimpleGraph (mean fitness of generation, 


"Mean Fitness by Generation"); 


/** 
* Output KnapsackProblem summary 
293 


private void showOptimalList() { 


// Output optimal list of items 


System.out.println("ânOptimal list of items to include in knapsack: 


double best fitness = 0; 


int best_gen = 0; 


// First, find best solution out of generational bests 
for (int z = 0; z < this.maximum generations - 1; z++) { 
if (this.best fitness of generation.get(z) > best fitness) { 
best fitness = this.best fitness of generation.get(z); 


best gen = z; 
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System.out.println ("Best generation is " + best gen); 


System.out.println("Best fitness is " + best fitness); 


// Then, go through that's generation's best solution and output items 
String optimal list = this.best solution of generation.get (best gen); 
for (int y = 0; y < this.number of items; yrr) { 

if (optimal list.substring(y, y + 1) .equals("1")) { 


System.out.print((y+ 1) +" "); 


System.out.println(); 


for (int i = 0; i < maximum generations - 1 ; i++) { 
System.out.println("Generation:" + "Nt" + i + "Nt" + "Fitness:" + 
"Nt" + this.best fitness of generation.get(i)); 


} 


/** 


* Breeds current population to create a new generation's population 


E 


private void breedPopulation(int i) ( 


// 2 genes for breeding 


int[] genes; 
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// If population size is odd f, use elitism to clone best solution of 


previous generation 
if (elitismOper == 1) { 


breed population.add (best solution of generation.get (i-1)); 


breed population.add (second best solution of generation.get (i-1)); 


} 


// Increase generation counter 


//generation counter = generation counter + 1; 


// Get positions of pair of genes for breeding 


genes = select (); 


// Crossover or cloning 


crossoverGenes(genes[0], genes[1]); 


public int[] select () { 


int[] gene = new int[2]; 


if (selectionOper == 1) { 
gene[0] = selectGeneTournament (); 
gene[1] = selectGeneTournament (); 
} else ( 


gene[0] = selectGeneRouletteWheel(); 


gene[1] = selectGeneRouletteWheel(); 


return gene; 
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/** 
* Performs mutation, if necessary 
x 


private void mutateGene() ( 


// Decide if mutation is to be used 
double rand mutation - Math.random(); 


if (rand mutation «- prob mutation) ( 


// If so, perform mutation 
mutation = true; 

String mut gene; 

String new mut gene; 

Random generator = new Random(); 
int mut_point = 0; 


double which gene = Math.random() * 100; 


// Mutate gene 

if (which gene <= 50) ( 
mut gene = breed population.get (breed population.size() - 1); 
mut point = generator.nextInt (number of items); 


if (mut_gene.substring (mut_point, mut point + 1).equals("1")) 


new mut gene = mut gene.substring(0, mut point) + "O" + 
mut gene.substring(mut point); 
breed population.set(breed population.size() - 1, 


new mut gene); 
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if (mut gene.substring (mut point, mut point + 1).equals("0")) 


new mut gene = mut gene.substring(0, mut point) + "1" + 


mut gene.substring(mut point); 


breed population.set(breed population.size() - 1, 


new mut gene); 


) 

if (which gene > 50) ( 
mut gene = breed population.get (breed population.size() - 2); 
mut point = generator.nextInt (number of items); 


if (mut gene.substring (mut point, mut point + 1).equals("1")) 


new mut gene = mut gene.substring(0, mut point) + "0" + 


mut gene.substring(mut point); 


breed population.set(breed population.size() - 1, 


new mut gene); 


) 


if (mut gene.substring (mut point, mut point + 1).equals("0")) 


new mut gene = mut gene.substring(0, mut point) + "1" + 


mut gene.substring(mut point); 


breed population.set(breed population.size() - 2, 


new_mut gene); 


/** 
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* Selects a gene for breeding 

* 

* @return int - position of gene in population ArrayList to use for 
breeding 

#/ 


private int selectGeneRouletteWheel () { 


// Generate random number between 0 and total fitness_of generation 


double rand = Math.random() * total fitness_of generation; 


// Use random number to select gene based on fitness level 


for (int i = 0; i < population size; i++) { 
if (rand <= fitness.get(i)) ( 
return i; 


) 


rand = rand - fitness.get(i); 


// Not reachable; default return value 


return 0; 


/** 
* Tournament selection 
* Written by the student 
* Greturn 


ua 


private int selectGeneTournament() { 


//Array of genes selected for the tournament 


double[][] tournament = new double[tournament size][2]; 


for (int j = 0; j < tournament size; j++) ( 
// Generate random position within the range of 0-population size 
Random r = new Random(); 
int rand = r.nextInt (population size); 


tournament[j][0] = rand; 


// fill in the tournament array with the position and the fitness 
value 
for (int i = 0; i < tournament size; i++) ( 
// Select random genes from the population 


tournament[i][1] = fitness.get((int) tournament[i] [0]); 


//Select the best individual 
double temp = tournament[0][1]; 
int gene position = 0; 
for (int n = 0; n < tournament size; n++ł) ( 
if (temp <= tournament[n][1]) { 
temp = tournament [n] [1]; 


gene position = (int) tournament[n] [0]; 


return gene position; 
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/** 
* Performs either crossover or cloning 
RZ 


private void crossoverGenes (int gene 1, int gene 2) { 


// Strings to hold new genes 
String new gene 1; 


String new gene 2; 


// Decide if crossover is to be used 
double rand crossover - Math.random(); 
if (rand crossover «- prob crossover) ( 
// Perform crossover 
crossover count = crossover count + 1; 
Random generator = new Random(); 


int cross point = generator.nextInt (number of items) + 1; 


// Cross genes at random spot in strings 
new gene_l = population.get (gene 1) .substring(0, cross point) 
population.get (gene 2) .substring (cross point); 


new gene 2 = population.get (gene _ 2) .substring (0, cross point) 


population.get (gene 1) .substring (cross point); 


// Add new genes to breed population 
breed population.add(new gene 1); 
breed population.add(new gene 2); 
} else ( 
// Otherwise, perform cloning 
clone count = clone count + 1; 
breed population.add (population.get (gene 1)); 


breed population.add(population.get(gene 2)); 


+ 
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// Check if mutation is to be performed 


mutateGene(); 


/** 
* Gets best solution in population 


* 


* Greturn int - position of best solution in population 


= 
private int getBestSolution() ( 
int best position = 0; 
double this fitness = 0; 
double best fitness = 0; 
for (int i = 0; i < population size; itr) { 
this fitness = evalGene(population.get(i)); 
if (this fitness > best fitness) ( 
best fitness = this fitness; 
best position = i; 
) 
) 
return best position; 
) 
/** 


* Gets second best solution in population 


* Greturn int - position of second best solution in 


RZ 


population 
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private int getSecondBestSolution() ( 
int second best position = 0; 
int best_ position = 0; 
double this fitness = 0; 


double best fitness = evalGene (population.get(0)); 


double second best fitness = evalGene (population.get (0)); 
for(int i = 1; i < population size;i++) 
( 

this fitness = evalGene(population.get(i)); 


if (this fitness > best fitness) ( 
second best fitness = best fitness; 
second best position = best position; 
best fitness = this fitness; 
best position = i; 
} 
if (this_fitness > second best_ fitness && this_fitness 
best_fitness) { 
second best_fitness = this_fitness; 


second best_position = i; 


} 


return second best position; 


/** 
* Gets mean fitness of generation 
294 

private double getMeanFitness() { 


double total fitness = 0; 
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double mean fitness = 0; 


for (int i = 0; i < population size; i++) { 


total fitness = total fitness + fitness.get(i); 
mean fitness = total fitness / population size; 


return mean fitness; 


/** 
* Evaluates entire population's fitness, by filling fitness ArrayList 
* with fitness value of each corresponding population member gene 
*/ 


private void evalPopulation() ( 


total fitness of generation = 0; 


for (int i = 0; i < population size; i++) { 


double temp fitness = evalGene (population.get (i)); 

fitness.add (temp fitness); 

total fitness of generation = total fitness of generation + 
temp fitness; 


) 


/** 


* Evaluates entire breed population's fitness, by filling breed fitness 


ArrayList 
* with fitness value of each corresponding breed population member gene 
234 
private void evalBreedPopulation() { 


total fitness of generation = 0; 


48 


for (int i = 0; i < population size; i++) { 


double temp fitness = evalGene(breed population.get(i)); 

fitness.add(temp fitness); 

total fitness of generation = total fitness of generation + 
temp fitness; 


) 


/** 
* Evaluates a single gene's fitness, by calculating the total weight 


* of items selected by the gene 


* Greturn double - gene's total fitness value 
uA 
private double evalGene(String gene) ( 
double total weight = 0; 
double total value = 0; 


double fitness value = 0; 


double difference = 0; 


// Get total weight associated with items selected by this gene 
for (int j = 0; j < number of items; j++) { 
c = gene.charAt (j); 
// If chromosome is a '1', add corresponding item position's 
// weight to total weight 
TE c(oo-— SE) 
total weight = total weight + weight of items.get (j); 


total value = total value + value of items.get(j); 
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) 


// Check if gene's total weight is less than knapsack capacity 


difference = knapsack capacity - total weight; 

if (difference >= 0) ( 
// This is acceptable; calculate a fitness_value 
// Otherwise, fitness_value remains 0 (default), since knapsack 
// cannot hold all items selected by gene 
// Fitness value is simply total value of acceptable permutation, 
// and for unacceptable permutation is set to '0' 
fitness_value - total value; 


this.total weight of solution.add(total weight); 


// Return fitness value 


return fitness_value; 


/** 
* Makes a population by filling population ArrayList with strings of 


* length number of items, each element a gene of randomly generated 


* chromosomes (1s and 0s) 
RZ 
private void makePopulation() ( 


for (int i = 0; i < population size; i++) { 


// Calls makeGene() once for each element position 


population.add (makeGene ()); 
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/** 


* Generates a single gene, a random String of 1s and Os 


* 


* (return String - a randomly generated gen 


A 


private String makeGene() ( 


// Stringbuilder builds gene, one chromosome (1 or 0) at a time 


StringBuilder gene = new StringBuilder (number of items); 


// Each chromosome 


char c; 


// Loop creating gene 
for (int i = 0; i < number of items; i++) { 
e = ON, 
double rnd = Math.random(); 
// If random number is greater than 0.5, chromosome is '1' 
// If random number is less than 0.5, chromosome is 'O' 
if (rnd > 0.5) ( 
c= '1'; 
) 
// Append chromosome to gene 
gene.append(c); 
) 
// Stringbuilder object to string; return 


return gene.toString(); 


/** 
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* Collects user to be used as parameters for knapsack problem 
uf 


private void getInput () { 


try { 
File myObj = new File ("dataset.txt"); 
Scanner s = new Scanner (myObj); 


while (s.hasNext ()) { 


/** 
* // Population size 


T population size = s.nextInt(); 


*/ 


//Selection Strategy 


selectionOper = s.nextInt(); 


//Elitism 


elitismOper = s.nextInt(); 


// Tournament size 


tournament size = s.nextInt(); 


// Maximum number of generations 


maximum generations = s.nextInt (); 


// Crossover probability 


prob crossover = s.nextDouble(); 


// Mutation rate 


prob mutation = s.nextDouble(); 
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// Number of items 


number of items = s.nextInt (); 


// Value of each item 
for (int i = 0; i < number of items; itr) { 


value of items.add(s.nextDouble()); 


// Weight of each item 
for (int i = 0; i < number of items; i++) { 


weight of items.add(s.nextDouble()); 


// Capacity of knapsack 


knapsack capacity = s.nextInt(); 


s.close(); 


// Hold user input, line by line 


String input; 


// Initialize console for user input 


Console c = System.console(); 


if 


// 


input = c.readLine ("Enter the population size: "); 


If 


(c == null) ( 


System.err.println("No console."); 


System.exit(l); 


Population size 


(isInteger(input)) ( 
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population size - Integer.parseInt(input); 
) else ( 
System.out.println("Not a number. Please try again."); 


System.exit(l); 


} catch (FileNotFoundException e) { 
System.out.println("An error occurred."); 


e.printStackTrace(); 


/** 
* Determines if input string can be converted to integer 
* 
* (param String - string to be checked 
* (return boolean - whether or not string can be converted 
Ky 
public static boolean isInteger(String str) ( 
EÉEy d 


Integer.parseInt (str); 


) catch (NumberFormatException e) ( 


return false; 


return true; 


/** 
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* Determines if input string can be converted to double 


* @param String - string to be checked 
* @return boolean - whether or not string can be converted 
#/ 


public static boolean isDouble (String str) { 


try { 


Double.parseDouble (str); 


} catch (NumberFormatException e) { 


return false; 


return true; 


} // KnapsackProblem 


Roulette Wheel Selection (written by the student) 


private int selectGeneRouletteWheel () { 


// Generate random number between 0 and total fitness of generation 


double rand = Math.random() * total fitness of generation; 


// Use random number to select gene based on fitness level 


for (int i = 0; i < population size; i++) { 
if (rand <= fitness.get(i)) { 
return i; 


} 
rand = rand - fitness.get (i); 
} 
// Not reachable; default return value 


return 0; 


Tournament Selection (written by the student) 


private int selectGeneTournament() ( 


//Array of genes selected for the tournament 


double[][] tournament - new double[tournament size][2]; 


for (int j = 0; J < tournament size; j++) ( 
// Generate random position within the range of 0-population size 


Random r = new Random(); 


int rand = r.nextInt (population size); 


tournament[j] [0] = rand; 


// fill in the tournament array with the position and the fitness value 
for (int i = 0; i < tournament size; irr) ( 
// Select random genes from the population 
tournament[i][1] = fitness.get((int) tournament[i][0]); 
) 
//Select the best individual 
double temp = tournament [0] [1]; 
int gene position = 0; 
for (int n = 0; n < tournament size; n++) { 
if (temp <= tournament[n][1]) { 
temp = tournament [n] [1]; 


gene position = (int) tournament [n] [0]; 


} 


return gene position; 


A.2: SimpleGraph.java 


[** 


* @filename: 


* @author: 


* @modified: 


* @description: 


appropriate 


* @usage: 


* @note: 


SimpleGraph.java 
Matthew Mayo 


2014-04-08 


Creates a SimpleGraph object based on supplied ArrayList 


of data points; draws graph, adds points, lines, 


hatch marks; must supply ArrayList of data points to 


and title of graph to display 


java SimpleGraph <data_points> <graph_title> 


Inspiration for, and adapted code, comes from: 


http://stackoverflow.com/guestions/8693342/drawing-a-simple-line-graph-in-j 


ava 
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import java.awt.BasicStroke; 
import java.awt.Color; 

import java.awt.Dimension; 
import java.awt.FontMetrics; 
import java.awt.Graphics; 
import java.awt.Graphics2D; 
import java.awt.Point; 

import java.awt.RenderingHints; 
import java.awt.Stroke; 

import java.util.ArrayList; 
import javax.swing.JFrame; 
import javax.swing.JPanel; 
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public class SimpleGraph extends JPanel ( 


private int width = 800; 


private int heigth = 400; 


180); 


200); 


private int padding = 25; 

private int label padding = 25; 

private int point width = 6; 

private int number y divisions = 0; 

private Color line color - new Color(44, 102, 230, 

private Color point color = Color.BLACK; 

private Color grid color = new Color(200, 200, 200, 

private static final Stroke GRAPH STROKE = new BasicStroke(2f); 
private String graph title = ""; 


private ArrayList<Double> data points; 


/** 


* Main method (for testing directly from this class) 


*/ 


public static void main(String[] args) ( 


// Create an ArrayList<Double> of data points 


ArrayList<Double> test data = new ArrayList<Double>(); 


// Add points to data points 
test data.add(1.0); 
test data.add(9.2); 


test data.add(5.7); 


test data.add(7.9); 


58 


test data.add(2.4); 


test data.add(11.5); 


// Set a graph title 


String test title = "Graph title goes here"; 


// Pass data points and graph title to SimpleGraph constructor 


SimpleGraph test = new SimpleGraph(test data, test title); 


/** 
* Default constructor 
d 


public SimpleGraph(ArrayList«Double» data points, String graph title) ( 


// Set data points data set and graph title 


this.data points data points; 


this.graph title = graph title; 


// Set number of y divisions by fidning difference between 


// max and min data points 


number y divisions = getMaxDataPoint() - getMinDataPoint(); 


// Set preferred size of panel 


this.setPreferredSize(new Dimension(800, 600)); 


// Create content frame, add to panel 


JFrame frame - new JFrame(graph title); 


frame.setDefaultCloseOperation(JFrame.EXIT ON CLOSI 


IE] 
— 
`~ 


frame.getContentPane().add(this); 


frame.pack(); 


frame.setLocationRelativeTo(null); 


frame.setVisible(true); 


/** 
* Creates and draws graph to specification 
* Gparam Graphics - What to be drawn 
x 

@Override 


public void paintComponent (Graphics g) { 


super.paintComponent (g) ; 
Graphics2D g2 = (Graphics2D) g; 


g2.setRenderingHint (RenderingHints.KEY ANTIALIASING, 


T 


RenderingHints.VALUE ANTIALIAS ON); 


// Set scales 


double xScale = ((double) getWidth() - (2 * padding) - label padding) 
/ (data points.size() - 1); 

double yScale = ((double) getHeight() - 2 * padding - label padding) 
/ (getMaxDataPoint() - getMinDataPoint()); 


// Create array of Point objects from passed in array of Doubles 
ArrayList<Point> graphPoints = new ArrayList<>(); 


for (int i = 0; i < data points.size(); i++) ( 


int xl (int) (i * xScale + padding + label padding); 


int yl = (int) ((getMaxDataPoint() - data points.get(i)) 


* yScale 
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+ padding); 


graphPoints.add(new Point(xl, yl)); 


// Draw white background 


g2.setColor(Color.WHITE); 


g2.fillRect(padding + label padding, padding, getWidth() - (2 * 
padding) 


- label padding, getHeight() - 2 * padding - label padding); 


g2.setColor(Color.BLACK); 


// Create hatch marks and grid lines for y axis 
for (int i = 0; i < number y divisions + 1; i++) ( 
if(number y divisions == 0) { 
number y divisions = number y divisions + 1; 
) 


int x0 = padding + label padding; 


int x1 = point width + padding + label padding; 


int y0 getHeight() - ((i * (getHeight() - padding * 2 
- label padding)) / number y divisions + padding + 
label padding); 
int yl = y0; 
if (data points.size() > O) ( 


g2.setColor(grid color); 


g2.drawLine(padding + label padding + 1 + point width, y0, 


getWidth() - padding, yl); 


g2.setColor(Color.BLACK); 


String yLabel = ((int) (getMinDataPoint() + (getMaxDataPoint () 
- getMinDataPoint()) * 
((i * 1.0) / number y divisions))) + " e 


FontMetrics metrics = g2.getFontMetrics(); 


61 


int labelWidth = metrics.stringWidth (yLabel); 


g2.drawString(yLabel, x0 - labelWidth - 5, y0 
+ (metrics.getHeight() / 2) - 3); 
) 


g2.drawLine(x0, y0, xl, yl); 


// Create hatch marks and grid lines for x axis 
for (int i = 0; i < data points.size(); i++) ( 


if (data points.size() > 1) ( 


int x0 = i * (getWidth() - padding * 2 - label padding) 
/ (data points.size() - 1) + padding + label padding; 
int xl = x0; 
int y0 - getHeight() - padding - label padding; 
int yl = y0 - point width; 


p 
Eh 
p 
oo 


((int) ((data points.size() / 20.0)) + 1)) == 0) { 
g2.setColor(grid color); 


g2.drawLine(x0, getHeight() - padding - label padding - 1 


- point width, xl, padding); 


g2.setColor (Color.BLACK); 


String xLabel = (i + 1) + ""; 
FontMetrics metrics = g2.getFontMetrics(); 


int labelWidth = metrics.stringWidth(xLabel); 


g2.drawString(xLabel, x0 - labelWidth / 2, y0 
+ metrics.getHeight() + 3); 
) 


g2.drawLine(x0, y0, xl, yl); 


// Create x and y axes 
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g2.drawLine(padding + label padding, getHeight() - padding - 
label padding, 


padding + label padding, padding); 


g2.drawLine(padding + label padding, getHeight() - padding - 


label padding, 


getWidth() - padding, getHeight() - padding - label padding); 


// Draw lines 
Stroke oldStroke - g2.getStroke(); 
g2.setColor(line color); 


g2.setStroke (GRAPH STROK 


Gl 


); 


for (int i 0; i < graphPoints.size() - 1; i++) ( 
int xl = graphPoints.get(i).x; 


int yl = graphPoints.get(i).y; 


int x2 = graphPoints.get(i + 1).x; 


int y2 = graphPoints.get(i + 1) .y; 


g2.drawLine(xl, yl, x2, y2); 


// Draw points 


g2.setStroke(oldStroke); 


g2.setColor(point color); 


for (int i = 0; i < graphPoints.size(); i++) { 


int x graphPoints.get (i).x - point width / 2; 


int y graphPoints.get(i).y - point width / 2; 


int ovalW = point width; 


int ovalH = point width; 


g2.fillOval(x, y, ovalW, ovalH); 


/** 
* Returns minimum data point in data points set 


* @return int - Minimum data point in set 


n 
private int getMinDataPoint() ( 
int min data point - Integer.MAX VALUE; 
Integer dp conv - 0; 
for (Double data point : data points) ( 
dp conv - (int) data point.doubleValue(); 
min data point = Math.min(min data point, dp conv); 
) 
return min data point; 
) 
/** 


* Returns maximum data point in data points set 
* Greturn int - Maximum data point in set 
ry 


private int getMaxDataPoint() { 


Gl 


int max data point = Integer.MIN VALU 
Integer dp conv - 0; 
for (Double data point : data points) ( 

dp conv = (int) data point.doubleValue() + 1; 

max data point = Math.max (max data point, dp conv); 
) 


return max data point; 
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) // SimpleGraph 


Appendix B - Sample of Input text 


B dataset.txt 


Appendix C - License to use the code 


https://github.com/mmmayo13/knapsack-problem-ga-java/blob/master/LICENSE 


The MIT License 
(MIT) 


Copyright (c) 2014 Matthew Mayo 


Permission is hereby granted, free of charge, to any person 
obtaining a copy 

of this software and associated documentation files (the 
"Software"), to deal 

in the Software without restriction, including without 
limitation the rights 

to use, copy, modify, merge, publish, distribute, sublicense, 


and/or sell 


copies of the Software, and to permit persons to whom the 


Software is 
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furnished to do so, subject to the following conditions: 


The above copyright notice and this permission notice shall 


be included in all 


copies or substantial portions of the Software. 


THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY 
KIND, EXPRESS OR 

IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF 
MERCHANTABILITY, 

FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO 
EVENT SHALL THE 

AUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES 
OR OTHER 

LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR 
OTHERWISE, ARISING FROM, 

OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER 
DEALINGS IN THE 

SOFTWARE. 
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